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(54) A method and a device for recognising speech 



(57) According to the invention, speech recognition 
can be limited to a smaller number making use of com- 
mands (16, 17) given by a user. By means of a method 
and device, according to the invention, it is also possible 
to activate a speech recognition device by making use, 
in the activation, of an existing keyboard or a touch-sen- 
sitive screen of the device. A method, according to the 
invention, provides the user with a logical way to activate 
the speech recognition device in the device simultane- 
ously providing an improved capacity of the speech rec- 
ognition device. According to the invention, it isalscTpos- 



sible to carry out the speech recognition process by 
making use of commands given by the user irrespective 
of how the speech recognition device itself is activated. 
According to an embodiment of the invention, the device 
comprises a touch-sensitive screen or surface, in which 
case the information about a single letter or several let- 
ters written on the screen is transmitted to the speech 
recognition device, whereupon speech recognition is 
limited to words, wherein the letters in question occur. 
Speech recognition is most preferably limited to a name 
beginning with the letter written on the touch screen by 
the user. 
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Description 

[0001] The present invention relates to a method for 
recognising speech and a device that utilises the speech 
recognition method according to the invention. s 
[0002] Normally, in mobile telephones, it is possible 
to browse through a telephone notepad to select a name 
by making use of the first letter of the name searched 
for. In this case, when a user during the search presses, 
e.g. the letter "s", the names beginning with the letter io 
"s" are retrieved from a memory. Thus, the user can 
more quickly find the name he/she is looking for without 
needing to browse through the content of the notepad 
in alphabetical order in order to find the name. This kind 
of method is fully manual and is based on the commands 15 
given by the user through a keyboard and the browsing 
of a memory based on this. 

[0003] Today, there are also some mobile stations that 
utilise speech recognition devices, wherein a user can 
give a command by voice. In these devices, the speech 20 
recognition device is often speaker-dependent; i.e. the 
operation of the speech recognition device is based on 
that the user teaches the speech recognition device 
words that the speech recognition device is supposed 
to later recognise. There are also so-called speaker-in- 25 
dependent speech recognition devices for which no 
separate training phase is required. In this case, the op- 
eration of the speech recognition device is based on a 
large amount of teaching material compiled from a large 
sampling of different types of speakers. Moderate oper- 30 
ation in case of a so-called average user is typical of a 
speaker-independent recognition device. Correspond- 
ingly, a speaker-dependent speech recognition device 
operates best for the person who has trained the speech 
recognition device. 35 
[0004] It is typical of both speech recognition devices 
mentioned above that the performance of the speech 
recognition device greatly depends on how large a vo- 
cabulary is used. It is also typical of speech recognition 
devices according to prior art that they are limited to a 40 
specific number of words, which the speech recognition 
device is capable of recognising. For example, in mobile 
stations, a user is provided with a maximum of 20 
names, which he/she can store in a notepad within the 
telephone by voice and, correspondingly, use these 45 
stored names in connection with voice selection. It is 
quite obvious that such a number is not sufficient in 
present or future applications, where the objective is to 
substantially increase the number of words to be recog- 
nised. As the number of words to be recognised increas- so 
es, e.g. ten-fold, with current methods, it is not possible 
to maintain the same speech recognition capacity as 
when using a smaller vocabulary. Another limiting factor, 
e.g. in terminal equipment, is the need for a memory to 
be used, which naturally increases as the vocabulary of 55 
the speech recognition device expands. 
[0005] In current speech recognition devices accord- 
ing to prior art, the activation of a speech recognition 
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device can be implemented by voice using a specific ac- 
tivation command, such as e.g. "ACTIVATE", whereup- 
on the speech recognition device is activated and is 
ready to receive commands from a user. A speech rec- 
ognition device can also be activated with a separate 
key. It is typical of speech recognition devices activated 
by voice that the performance of the activation is de- 
pendent on the noise level of the surroundings. Also dur- 
ing the operation of the speech recognition device, the 
noise level of the surroundings greatly affects the per- 
formance of the speech recognition device to be 
achieved. It can be said that critical parameters for the 
performance of a speech recognition device are the ex- 
tent of the vocabulary and the noise conditions of the 
surroundings. 

[0006] A further known speech recognition system is 
disclosed in US 4,866,778 where a user can select a 
sub-vocabulary of words by selecting an initial string of 
one or more letters causing the recognition to be per- 
formed against the sub-vocabulary restricted to words 
starting with those initial letters. 

[0007] Now, we have invented a method and a device 
for recognising speech the objective of which is to avoid 
or, at least, to mitigate the above-mentioned problems 
of prior art. The present invention relates to a device and 
a method, wherein a user is allowed to give, during 
speech recognition, a qualifier by means of which 
speech recognition is only limited to those speech mod- 
els that correspond with the qualifier provided by the us- 
er. In this case, only a specific sub-set to be used during 
speech recognition is selected from the prestored 
speech models. 

[0008] According to an embodiment of the invention, 
a speech recognition device is activated at the same 
time as a qualifier that limits speech recognition is pro- 
vided by touching the device making use of the existing 
keyboard or touch-sensitive screen/base of the device. 
The activation is most preferably implemented with a 
key. A method according to the invention provides a user 
with a logical way to activate the speech recognition de- 
vice therein at the same time providing an improved per- 
formance of the speech recognition device along with 
the entered qualifier. The limitation of speech recogni- 
tion according to the invention can also be implemented 
apart from the activation of the speech recognition de- 
vice. 

[0009] According to an exemplary embodiment of the 
invention, the device comprises a touch-sensitive 
screen or surface (base), whereupon the information 
about the character or several characters written on the 
screen is transmitted to the speech recognition device, 
in which case speech recognition is limited to words 
wherein the characters in question occur. Speech rec- 
ognition is most preferably limited to a name beginning 
with the character written by the user on the touch 
screen. 

[0010] According to an exemplary embodiment of the 
invention, speech recognition can also be implemented 



EP 0 961 263 A2 



2 



BNSDOCID:<EP 0961 263 A2 I > 



3 EP 0 961 

by making use in advance of all the stored models and 
by utilising the limiting qualifier provided by the user 
when defining the final recognition result. 
[0011] According to a first aspect of the invention 
there is provided a method for recognising an utterance s 
of a user with a device, wherein a set of models of the 
utterances have.been stored in advance and for speech 
recognition, the utterance of the user is received, the 
utterance of the user is compared with the prestored 
models and, on the basis of the comparison, a recogni- 10 
tion decision is made, the method being characterised 
in that, 

the user is allowed to provide a qualifier limiting the 
comparison by touching the device, the qualifier is 
itentifying an item in a men u structure of the device, 
a sub-set of models is selected from the stored 
models on the basis of the qualifier provided by the 
user said sub-set of models identifying sub-items of 
the menu structure, and 20 
a comparison is made for making the recognition 
decision by comparing the utterance of the user with 
said sub-set of models. 

[0012] According to a second aspect of the invention 25 
there is provided a method for recognising an utterance 
of a user with a device, wherein a set of models of the 
utterances have been stored in advance and for speech 
recognition, the utterance of the user is received, the 
utterance of the user is compared with the prestored 30 
models and, on the basis of the comparison, a recogni- 
tion decision is made, the method being characterised 
in that, 

a comparison is made for making a first recognition 35 
decision by comparing the utterance of the user with 
the prestored models, 

the user is allowed to provide a qualifier limiting the 
comparison by touching the device for selecting a 
sub-set of models, the qualifier itentifying an item in 40 
a menu structure of the device and said sub-set of 
models identifies sub-items of the menu structure, 
a final comparison is made for making the recogni- 
tion decision by comparing the first recognition de- 
cision with said sub-set of models. 45 

[0013] According to a third aspect of the invention 
there is provided a device comprising a speech recog- 
nition davtca for recognising -the utterance of a usac, 
memory means for storing speech models, and means sv 
for receiving the utterance of the user, comparison 
means for carrying out the recognition process by com- 
paring the utterance of the user with the models stored 
in the memory means, the device being characterised 
in that the device also comprises means for receiving a ss 
qualifier from the user by touching the device, means 
for selecting a set from the stored models on the basis 
of the qualifier received from the user for limiting the 
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comparison made by the comparison means to said set 
of models and means for storing a menu structure of a 
device and for identifying the received qualifier as an 
item in a menu structure of the device. 

Figure 1 shows the structure of a speech recognition 
device, according to prior art, as a block di- 
agram, 

Figure 2 shows the structure of a speech recognition 
device, according to the invention, as a 
block diagram, 

Figure 3 shows the operation of a method, accord- 
ing to the invention, as a flowchart, 

Figure 4 shows the operation of another method, ac- 
cording to the invention, as a flowchart, and 

Figure 5 shows the structure of a mobile station uti- 
lising a method according to the invention. 

[0014] Figure 1 shows the block diagram structure of 
a known speech recognition device as applicable to the 
present invention. Typically, the operation of the speech 
recognition device is divided into two different main ac- 
tivities: an actual speech recognition phase 10-12, 
14-15 and a speech training phase 10-13, as shown in 
Figure 1 . The speech recognition device receives from 
a microphone as its input a speech signal S(n), which is 
converted into a digital form by an A/D converter 1 0 us- 
ing, e.g. a sampling frequency of 8 kHz and a 1 2-bit res- 
olution per sample. Typically, the speech recognition de- 
vice comprises a so-called front -end 11, wherein the 
speech signal is analysed and a feature vector 12 is 
modelled, the feature vector describing the speech sig- 
nal during a specific period. The feature vector is de- 
fined, e.g. at 10 ms intervals. The feature vector can be 
modelled using several different techniques. For exam- 
ple, different kinds of techniques for modelling a feature 
vector have been presented in the reference: J. Picone, 
"Signal modeling techniques in speech recognition", 
IEEE Proceedings, Vol. 81, No. 9, pp. 1215-1247, Sep- 
tember 1993. During the training phase, models are 
constructed by means of the feature vector 1 2, in a train- 
ing block 13 of the speech recognition device, for the 
words used by the speech recognition device. In model 
training 13a, a model is defined for the word to be rec- 
ognised. In the training phase, a repetition of the word 
to be modelled can be utilised. The models are stored 
in a memory 1 3b. During speech recognition, the feature 
vector 12 is transmitted to an actual recognition device 
14, which compares, in a block 15a, the models con- 
structed during the training phase with the feature vec- 
tors to be constructed of the speech to be recognised, 
and the decision on the recognition result is made in a 
block 15b. The recognition result 15 denotes the word, 
stored in the memory of the speech recognition device, 



3 



BNSDOCID: <EP 0961 263 A2J_> 



5 



EP 0 961 263 A2 



6 



that best corresponds with the word uttered by a person 
using the speech recognition device. 
[0015] Figure 2 shows the operation of a speech rec- 
ognition device according to the invention where, in ad- 
dition to the solution according to Figure 1, the speech 
recognition device comprises a block 16, wherein the 
selection of the models is carried out on the basis of the 
commands given by a user e.g. through a keyboard. 
The block 16 receives as its input a signal 17 containing 
the information on which key the user has pressed. In 
the block 16, speech models 18, transmitted by the 
block 13b, are compared with the signal 17 and a sub- 
set 19 is selected from these and transmitted to the 
block 15a of the speech recognition device. The selec- 
tion of the models relating to the operation of the block 
16 has been described below making use of a memory 
structure according to the present invention. 

Table 1 



Name 


Number 


Reference Model 


Smith Paul 


0405459883 


XXX... X 








Table 2 





Menu 








Phone Settings 






Messages 








Read messages 






Write messages 




Memory Functions 





[0016] Table 1 shows a memory structure according 
to the invention, which may form, e.g. a phone notepad 
of a mobile station or part of it. The memory comprises 
the name of a person, a telephone number that corre- 
sponds with the name, as well as a reference model (e. 
g. a feature vector) constructed during the speech rec- 
ognition training phase. The table shows as an example 
one line of the table, on which a person's name "Smith", 
a corresponding telephone number "0405459883", as 
well as a data field containing a reference model "xxx .. 
x" are stored. The length of the reference model is a 
speech recognition device-specific parameter and, 
therefore, the field length depends on the speech rec- 
ognition device used. According to the invention, when 
a user presses a specific key of the device, e.g. the key 
"s", the processor of the device goes through the content 
of the memory and compares the content of the data 
field containing the name and only retrieves from the 
memory the names that begin with the letter "s". The 
comparison can be made, e.g. by comparing the ASCII 
character of the pressed key with the ASCII character 
of the first letter of the name in the memory and by se- 



lecting the reference model that corresponds with the 
name provided that the letters correspond with one an- 
other in the comparison. The information on the selected 
reference models (sub-set) is then transmitted to the 
s speech recognition device, after which the speech rec- 
ognition device carries out speech recognition using the 
models that relate to the names selected above. 
[0017] The user can further also press another key, e. 
g. the key "m", whereupon speech recognition is limited 
10 to the names beginning with the letter combination 
"Sm°. In this case, the number of names to be recog- 
nised can be further limited, i.e. the sub-set of models 
decreases. In addition, it is also possible that the mem- 
ory contains fields other than the above-mentioned 
is name field on the basis of which the speech recognition 
device is activated according to the invention. The tele- 
phone memory of a device, such as a mobile station, 
may contain, e.g. a field that indicates whether a specific 
number is a mobile station number or not. In this case, 
20 the memory field may contain, e.g. an element "GSM", 
whereupon when the user activates this field only the 
GSM numbers are selected and not the others, e.g. the 
numbers of a fixed network or fax numbers. Thus, the 
invention is not limited to a case wherein the letter se- 
25 lected by the user controls the operation of the speech 
recognition device but, instead, the user may select 
. names, e.g. from a telephone notepad according to 
some other classification. For example, the names in a 
telephone notepad may have been divided into classes 
30 like "Home", "Office", "Friends", in which case the mo- 
bile station may provide a convenient way to select from 
the menu, e.g. the class "Friends", whereupon speech 
recognition is directed to the names in this class, ac- 
cording to the invention. It is also possible that the mo- 
ss bile station comprises a keyboard, wherein several dif- 
ferent characters are combined in a specific key. For ex- 
ample, the letter symbols "j, k, I" may be included in the 
numeric key "5". In this case, the invention can be ap- 
plied so that when the user presses the key "5", the 
40 speech recogn it ion device is activated so that, in speech 
recognition, it is limited to names beginning with the let- 
ters "j", "k" or "I". In an exemplary embodiment of the 
invention, when the user presses the key SEND speech 
recognition, according to the invention, can be limited, 
45 e.g. to the last made calls (e.g. the last 10 calls). In this 
case, a call can be commenced, e.g. by pressing and 
holding the key SEND, while the user simultaneously 
pronounces the name to be recognised, whereupon 
speech recognition is limited to a set of models contain- 
so ing the name/symbol of the last 10 calls. 

[0018] The speech recognition device is most prefer- 
ably activated by a press-and-hold, whereupon the de- 
vice (speech recognition device) is informed by the 
pressing and holding of the key in question that you want 
55 speech recognition to commence. At the same time, the 
information about the pressed key is transmitted to the 
speech recognition device, i.e. speech recognition is 
limited, e.g. to words beginning with the letter on the key, 
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whereupon only the reference models that the user 
wants are activated. It is also according to the invention 
that the speech recognition device is activated in a way 
other than by pressing a key, for example, by voice. In 
this case, after the activation of the speech recognition 
device, it is possible to use, during speech recognition, 
the reference model selection according to the invention 
as presented above. 

[0019] An arrangement, according to the invention, 
can also be made for the menu structure of a mobile 
station, as shown in Table 2. Table 2 shows a specific 
part of the menu structure of a telephone. In this exam- 
ple, the main menu consists of the menus "Telephone 
Settings", "Messages" and "Memory Functions". Corre- 
spondingly, the menu "Messages" consists of the sub- 
menus "Read messages" and "Write messages". When 
a user of the telephone selects a men u function by voice 
or pressing a menu key, activation is limited to the points 
in the menu. In the example, activation by voice is di- 
rected to the menus "Telephone Settings", Messages" 
or "Memory Functions". The user can further manually 
select the submenu "Messages", in which case activa- 
tion by voice is directed to the points "Read messages" 
or "Write messages" of the menu in question. The 
above-mentioned method can also be applied to exte- 
rnal services for a mobile station and their activation. In 
this case, a specific key of the mobile station is defined 
for a specific service, e.g. for a WWW service (World 
Wide Web). In this case, the pressing and holding of the 
key in question enables, e.g. the selection of a book- 
mark of WWW addresses by using a voice command. 
In this application, the mobile station contains a table of 
letter symbols, which are selected as described above. 
[0020] Figure 3 shows the activity sequence of a 
method according to the invention. In Phase 30, it is dis- 
covered whether or not a user has carried out the press- 
and-hold that activates the speech recognition device. 
If no press-and-hold is discovered, the device remains 
in a state where the activation of the speech recognition 
device is being anticipated. Alternatively, the speech 
recognition device can be activated at the same time, 
when the user starts writing on a touch-sensitive sur- 
face, such as a screen. The activation of the speech rec- 
ognition device may also be based on voice. In Phase 
31 , the letter/text written on the touch screen is recog- 
nised. In Phase 32, the information about the pressing 
of the key is transmitted to the speech recognition de- 
vice and/or the information about the alphanumeric 
character written or drawn by the user on the touch 
screen is transmitted. It is also possible to draw, on the 
touch screen, some other figure that deviates from an 
alphanumeric character, which is utilised in speech rec- 
ognition. In Phase 33, it is examined whether or not the 
user is still carrying out the pressing of keys or writing 
on the touch screen, in which case the information about 
these activities is also transmitted to the speech recog- 
nition device. This can be done by comparing the activ- 
ities of the user with a specific time threshold value by 



means of which it is decided whether or not the user has 
concluded the giving of commands. In Phase 34, the 
word pronounced by the user is recognised by making 
use of the information provided in Phase 32. 
5 [0021] Figure 4 shows another activity sequence of a 
method according to the invention. In this method, the 
pronounced word is first recognised traditionally and, 
only after this, the limitation provided by the user is uti- 
lised to limit the result obtained during the recognition 
10 phase. In Figure 4, Phases 30-33 correspond with the 
corresponding phases in Figure 3. In Phase 35, the ut- 
terance of the user is recognised by making use of all 
the prestored models. The information about this recog- 
nition result is transmitted to Phase 34, wherein the final 
15 recognition decision is made by comparing the first rec- 
ognition decision with said sub-set of models, which has 
been obtained on the basis of the limitation provided by 
the user. The recognition decision obtained from Phase 
35 contains a set of proposed words that have been rec- 

20 ognised and the recognition probabilities corresponding 
with the words, which are transmitted to Phase 34. In 
case of a faulty recognition, the word that has got the 
highest recognition probability is not the word pro- 
nounced by the user. In this case, in Phase 34 according 

25 to the invention, it is possible to carry out the final 
speech recognition phase by means of the qualifier pro- 
vided by the user and reach a higher speech recognition 
performance according to the invention. A method ac- 
cording to the invention may also operate so that the 

30 giving of a limitation and the recognition of a pronounced 
word are essentially simultaneous activities. 
[0022] Figure 5 shows the structure of a mobile sta- 
tion, which has a speech recognition device 66 that uti- 
lises the present invention. The mobile station compris- 
es es parts typical of the device, such as a microphone 61 , 
a keyboard 62, a screen 63, a speaker 64, and a control 
block 65, which controls the operation of the mobile sta- 
tion. According to an embodiment of the invention, the 
screen 63 may be a touch-sensitive surface, such as a 

40 screen. In addition, the figure illustrates transmitter and 
receiver blocks 67, 68 typical of a mobile station. The 
control block 65 also controls the operation of the 
speech recognition device 66 in connection with the mo- 
bile station. When the speech recognition device is be- 

45 ing activated either during the speech recognition de- 
vice's training phase or during the actual speech recog- 
nition phase, the voice commands given by the user are 
transmitted, controlled by the control block, from the mi- 
crophone 61 to the speech recognition device 66. Ac- 

50 cording to the invention, the control block 65 transmits 
to the speech recognition device 66 the information 
about the commands given by the user through keys or 
about the alphanumeric character/figure entered on to 
the touch screen. The voice commands can also be 

55 transmitted through a separate HF (hands free) micro- 
phone. The speech recognition device is typically imple- 
mented by means of DSP and it comprises external and/ 
or internal ROM/RAM circuits 69 necessary for its oper- 
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at ion. 

[0023] An embodiment of the present invention may 
comprise a device, e.g. a mobile station, which has a 
touch-sensitive surface, such as a touch-sensitive 
screen or base. In this case, a user writes the first letter 
of the word to be recognised on the touch-sensitive sur- 
face, e.g. with a pen or draws with a finger and simulta- 
neously pronounces the word to be recognised (alter- 
natively, the user presses the point of the letter dis- 
played on the screen). In this case, the information 
about the provided letter is transmitted to the speech 
recognition device and speech recognition is limited to 
words in which the letter in question occurs. Recognition 
is most preferably limited to words that begin with the 
initial in question as described above. In this case, the 
user may write, according to the invention, on the touch- 
sensitive surface, e.g. the letter "S B and simultaneously 
pronounce the name to be recognised, e.g. "Smith", 
whereupon speech recognition is limited to names be- 
ginning with the letter "S". 

[0024] Alternatively, the user may first write a letter on 
the touch screen and, after this, pronounce the word to 
be recognised. The above-mentioned method based on 
keys and writing on a touch-sensitive surface can also 
be combined, in which case the user can both write on 
a touch-sensitive surface and press some key and uti- 
lise both of these data in speech recognition. The touch- 
sensitive surface in itself is outside this invention, and it 
can be implemented in various ways according to prior 
art. 

[0025] It can be estimated that with a method, accord- 
ing to the present invention, a recognition accuracy, 
which is 10-30-fold compared with recognition devices 
according to prior art can be achieved if the number of 
names to be recognised remains the same. On the other 
hand, by means of the invention, it is possible to recog- 
nise according to the invention 10-30 times as many 
names can be recognised while the recognition accura- 
cy remains unchanged. This improved capacity is based 
on a combination, according to the invention, whereup- 
on commands given by the user through keys/a touch- 
sensitive surface, i.e. qualifiers limiting speech recogni- 
tion search, are combined with speech recognition. One 
exemplary embodiment of the invention was based on 
the utilisation of a touch screen. An advantage of this 
application is that the algorithms used in text recognition 
and speech recognition are almost identical, whereupon 
the amount of program memory required does not in- 
crease much in adavica, where both of these functions 
are implemented. 

[0026] Above we described a mobile station as an ex- 
emplary embodiment of the present invention. However, 
the invention can equally well be applied, for example, 
to computers. The present invention is not limited to the 
embodiments presented above, and it can be modified 
within the framework of the enclosed claims. 



Claims 

1 . A method for recognising an utterance of a user with 
a device, wherein a set of models of the utterances 

5 have been stored in advance and for speech rec- 

ognition, the utterance of the user is received, the 
utterance of the user is compared with the prestored 
models and, on the basis of the comparison, a rec- 
ognition decision is made, characterised in that, 

io in the method, 



the user is allowed to provide a qualifier limiting 
the comparison by touching the device, the 
qualifier itentifying an item in a menu structure 
of the device, 

a sub-set of models is selected from the stored 
models on the basis of the qualifier provided by 
the user said sub-set of models identifying sub- 
items of the menu structure, and 
a comparison is made for making the recogni- 
tion decision by comparing the utterance of the 
user with said sub-set of models. 



15 



20 



2. A method for recognising an utterance of a user with 
a device, wherein a set of models of the utterances 
have been stored in advance and for speech rec- 
ognition, the utterance of the user is received, the 
utterance of the user is compared with the prestored 
models and, on the basis of the comparison, a rec- 
ognition decision is made, characterised in that, 
in the method, 

a comparison is made for making a first recog- 
nition decision by comparing the utterance of 
the user with the prestored models, 
the user is allowed to provide a qualifier limiting 
the comparison by touching the device for se- 
lecting a sub-set of models, the qualifier itenti- 
fying an item in a menu structure of the device 
and said sub-set of models identifies sub-items 
of the menu structure, 

a final comparison is made for making the rec- 
ognition decision by comparing the first recog- 
nition decision with said sub-set of models. 

3. A method according to claim 1 or 2, characterised 
in that a speech recognition device is activated in 
response to the qualifier provided by the user. 



50 4. A method according to claim 1 or 2, characterised 
in that the user is allowed to give said qualifier by 
pressing a key. 



25 



30 



35 



40 



45 



55 



A method according to claim 1 or 2, characterised 
in that the user is allowed to provide said qualifier 
by writing an alphanumeric character on a touch- 
sensitive surface of the device. 
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6. A method according to claim 3 or 4, characterised 
in that the user is allowed to provide said qualifier 
as a press-and-hold. 

7. A device comprising a speech recognition device 
(66) for recognising the utterance of a user, memory 
means (69) for storing (13b) speech models, and 
means (61 ) for receiving the utterance of the user, 
comparison means (19, 15a, 15b) for carrying out 
the recognition process by comparing the utterance 
of the user with the models stored in the memory 
means, characterised in that the device also com- 
prises means (62, 63) for receiving a qualifier (17) 
from the user by touching the device, means (16) 
for selecting a set from the stored models on the 
basis of the qualifier received from the user for lim- 
iting the comparison made by the comparison 
means (19, 15a, 15b) to said set of models and 
means (65) for storing a menu structure of a device 
and for identifying the received qualifier as an item 
in a menu structure of the device. 

8. A device according to claim 7, characterised in that 
the means for receiving the qualifier from the user 
comprise a keyboard. 25 

9. A device according to claim 7, characterised in that 
the means for receiving the qualifier comprise a 
touch-sensitive surface. 

30 

1 0. A device according to claim 7, characterised in that 
it comprises means (62, 63, 65) for activating the 
speech recognition device in response to the qual- 
ifier received from the user. 

35 



40 



45 



50 



55 



7 

BNSDOCID: <EP 0961 263 A2_l_> 



10 



15 




BNSDOCID: <EP 



0961 263 A2 I > 



EP 0 961 263 A2 




BNSDOCID: <EP 0961263A2J_> 




BNSDOCID: <EP 096 12 63 A2 I > 



10 



EP 0 961 263 A2 



CO 



3 

CD 




BNSDOCID: <EP 0961 263 A2_l_> 



11 



EP 0 961 263 A2 




